Local Unitary Quantum Cellular Automata 
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In this paper we present a quantization of Cellular Automata. Our formalism is based on a lattice of audits, 
and an update rule consisting of local unitary operators that commute with their own lattice translations. One 
purpose of this model is to act as a theoretical model of quantum computation, similar to the quantum circuit 
model. It is also shown to be an appropriate abstraction for space-homogeneous quantum phenomena, such as 
quantum lattice gases, spin chains and others. Some results that show the benefits of basing the model on local 
unitary operators are shown: universality, strong connections to the circuit model, simple implementation on 
quantum hardware, and a wealth of applications. 
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I. INTRODUCTION 

The Cellular Automaton (CA) is a computational model 
that has been studied for many decades |Q1 0]. It is a sim- 
ple yet powerful model of computation that has been shown 
to be Turing complete [2]. It is based on massive parallelism 
and simple, locally constrained instructions, making it ideal 
for various applications. In particular, CA are very effective 
at simulating many classical physical systems, including gas 
dispersion, fluids dynamics, ice formation, and even biolog- 
ical colony growth (3J. Although usually simulated in soft- 
ware, CA hardware implementations have also been devel- 
oped. All of these characteristics make CA a strong tool for 
moving from a physical system in nature, to a mathematical 
model, to an implemented physical simulation. 

More recently, the idea of Quantum Cellular Automata 
(QCA) has emerged. Several theoretical mathematical mod- 
els have been proposed IJ, S la III IMl • However, there is a 
lack of applications developed within these models. On the 
other hand, ad hoc models for specific applications like quan- 
tum lattice gases lH [T^l, among others ifTTIl . have been de- 
veloped. Several proposals for scalable quantum computation 
(QC) have been developed that use ideas and tools related to 
QCA JH m M El 161. Some of these have been shown 
to be capable of universal computation fl7l[T8ll . Other QCA 
tools have been used to solve, or propose solutions to, partic- 
ular problems in physics [19, 20, 2 U UA 12311 . 

However, there does not exist a comprehensive model of 
QCA that encompasses these different views and techniques. 
Rather, each set of authors defines QCA in their own partic- 
ular fashion. In short, there is a lack of a generally accepted 
QCA model that has all the attributes of the CA model men- 
tioned above: simple to describe; computationally powerful 
and expressive; efficiently implemented in quantum software 
and hardware; and able to efficiently and effectively model 
appropriate physical phenomena. 

The purpose of this paper is to propose such a model. 
The model we present here is based on intuitive and well- 
established ideas: qudits as the basic building blocks (cells), 
and local unitary operators as the basic evolution method (lo- 
cal update rule). 

The choice of local unitary operators as the basic evolu- 
tion operator ensures that the model is simple and easily ex- 



plained to anyone familiar with the field of quantum infor- 
mation. However, the choice is not made merely for sake of 
simplicity: it provides us with an efficient implementation of 
QCA on quantum hardware, while still enjoying an expressive 
richness strong enough to simulate any appropriate physical 
system. 

Formally, what we mean by efficient implementation is that 
there exists a uniform family of quantum circuits that can 
each simulate the evolution of a finite region of the QCA, 
for a specified number of steps. Furthermore, we require that 
the depth of each circuit be strictly linear in the number of 
steps, and constant on the size of the region being simulated. 
This last requirement is to ensure that the QCA retains the 
quintessential quality of CA: massive parallelism. 

We will refer to this formalization as the Local Unitary 
Quantum Cellular Automata (LUQCA) model, when we need 
to make the distinction from other formal definitions of QCA. 

In Section [IV] we will see how any QCA properly defined 
in the model presented here can be efficiently implemented. 
The fact that there is such a guarantee, without any further 
restraints, is one of the strongest features of the model herein 
presented. In Section [VIII| we will see that in general, previous 
models cannot make such a guarantee. We will also discuss 
what methods can be used to translate QCA in these models 
into the LUQCA model. 

We will see in Sections [TVl and W\ how insisting on efficient 
implementations does not at all limit the expressive power of 
our QCA model. SectionlVlwill also show how most, if not all, 
physical systems of interest with the proper characteristics — 
time and space homogeneity — can be modeled using local 
unitary QCA. We will also prove computational completeness 
in section [IV] Section IVIIII discusses how valid QCA pre- 
sented in other models can be rephrased in the local unitary 
QCA scheme. 

We begin in Section [LI] by briefly describing classical CA 
in detail. Following that, we will endeavor to quantize this 
model in the most natural way possible. The rest of the paper 
presents results pertaining to the strengths of this model. 



II. CELLULAR AUTOMATA 

In the classical model of cellular automata, we begin with 
a finite set of states £ and an infinite lattice of cells, each of 
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which is in one of the states in S. We have discrete time steps, 
and at each time step t, the state of the lattice evolves accord- 
ing to some rule. This rule gives the state of each cell at time 
t + 1 as a function of the states of the cells in its neighbor- 
hood, which is simply a finite set of cells corresponding to a 
particular cell. 

Definition 1 (CA). A Cellular Automaton is a 4-tuple 
(L, X, Af, f) consisting of a d- dimensional lattice of cells in- 
dexed by integers, L = Z d , a finite set X of cell states, a finite 
neighborhood scheme M C Z d , and a local transition func- 
tion f : -> S. 

The transition function / simply takes, for each lattice cell 
position x e L, the states of the neighbors of x, which are 
the cells indexed by the set x + Af at the current time step 
t € Z to determine the state of cell x at time t + 1. There 
are two important properties of cellular automata that should 
be noted. First, cellular automata are space-homogeneous, in 
that the local transition function performs the same function at 
each cell. Also, cellular automata are time-homogeneous, in 
that the local transition function does not depend on the time 
step t. 

We may also view the transition function as one which acts 
on the entire lattice, rather than on individual cells. In this 
view, we denote the state of the entire CA as a configuration 
C € S L which gives the state of each individual cell. This 
gives us a global transition function which is simply a function 
that maps F : T, L — > S L . 



A. Reversible and Partitioned CA 

As a first step towards developing a theory of unitary CA 
we will revisit the theory of classical reversible automata. 

A CA is reversible if for any configuration C E S L , and 
time step i e Z there exists a unique predecessor configura- 
tion C such that C = F(C, t). It is known that any Turing 
machine can be simulated using a reversible CA [24], so no 
computational power is lost by this restriction. 

One method that is used to construct reversible cellular au- 
tomata is partitioning. In a partitioned CA, the transition func- 
tion is composed of local, reversible operations on individual 
units of a partition of the lattice. 

In order to formally define partitioned CA, we must expand 
the definition of cellular automata, as partitioned CA are nei- 
ther time-homogeneous nor space-homogeneous in general. 
They are, however, periodic in both space and time, and thus 
we set both a time period T > 1 and a space period, given 
as a d-dimensional sublattice S of L = Z d . The sublattice S 
can be defined using a set {v k ■ k — 1, . . . , d} of d linearly 
independent vectors from L = Z d as: 
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FIG. 1: Partitioned Cellular Automaton 



(B + S2) = 0/or any Si, S2 € S with s± ^ S2, and such that 



[}{B + s) = r 



Definition 2. For a given fixed sublattice S C Z d , we define 
a block B C 1 d as a finite subset of1 d such that {B + s%) (1 



The main idea of the partitioned CA is that at different time 
steps, we act on a different block partition of the lattice. We 
are now ready to formally define the partitioned CA. 

Definition 3. A Partitioned CA is a 6-tuple (L, S, T, S, B, J 7 ) 

consisting of 

1. a d-dimensional lattice of cells indexed by integers, L = 

7L d ; 

2. a d-dimensional sublattice S C L; 

3. a time period T > 1; 

4. a finite set S of cell states; 

5. a block scheme B, which is a sequence 
{Bq, Bi, . . . , Bt-i} consisting of T blocks rela- 
tive to the sublattice S; and 

6. a local transition function scheme T, which is a set 
{/o,/i, . . . j/t— 1} of reversible local transition func- 
tions which map f t : X s * — ► T, Bt . 

At time step t + kT for < t < T and k £ Z, we perform 
ft on every block Bt + s, where s 6 S. In order to find the 
reverse of a partitioned CA, we simply give the reverse block 
scheme, B = {Bt-i, ■ ■ ■ , B\, Bo}, and the reverse function 
scheme, T = {/ T l l5 . . ., /f 1 ,/^ 1 }- 

Although the partitioned CA is not time- or space- 
homogeneous, it can be converted into a regular CA, on the 
lattice S (which is isomorphic to Z d ), with cell states T, B , 
where the new local transition function simulates T time steps 
of the partitioned CA in one time step. 

In the original partitioned CA scheme as described by 
Margolus [25], the sublattice was fixed as S = 2Z d , and 
the block scheme was fixed with two partitions: B = 
{{xi,x 2 ■ ■ ■ ,x d ) :0<Xj< l}and_Bi = {(xi, x 2 ■ ■ ■ ,Xd) ■ 
l<Xj< 2}. 
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III. LOCAL UNITARY QCA 

Now, with a formal notion of CA, we can proceed to give a 
quantization. As mentioned earlier, we will have very specific 
goals in mind. 



A. Model Requirements 

First, we want to develop an intuitive model that is both 
simple to work with, and to develop algorithms for. At the 
same time, we want this model to be an obvious extension 
of classical CA and to reduce to classical CA behavior under 
reasonable limits. 

Second, we want to keep our model grounded in physical 
realities. This has a couple of strong consequences. Namely, 
we approach CA, even classical CA, not as abstract mathe- 
matical structures, but as models representing real physical 
systems. As a consequence, we expect our model to reliably 
model quantum systems with appropriate behavior, e.g., spin 
chains. Also, an algorithm described in our model should be 
easy to translate to an actual physical implementation on such 
quantum systems. We show in Section lVTl that this is so. 



B. A First Approach 

The first step in our quantization of CA is to change the 
state space of a cell to reflect a quantum system. There are 
several methods for doing so, however we believe that the 
most natural way to approach this is to convert the alphabet 
of the cellular automaton, S, into orthogonal basis states of 
a Hilbert space, H^. Formally, every cell i £ lis assigned 
a qudit, \x) G TCs- This gives us a strong intuitive tool, as 
the notion of a lattice of qudits should be familiar to anyone 
working in quantum information theory. 

As we shall see, our approach is also physically grounded, 
in that it is possible to describe this model in terms of a quan- 
tum system evolving according to a Hamiltonian. As an ex- 
ample, spin chains can be directly described by such math- 
ematical constructions. Lattice gases, though not originally 
modeled in this way, can also be easily described by such 
mathematical constructs. Perhaps the most obvious physical 
example is the pulse-driven quantum computer. 

We also wish to quantize the standard classical CA update 
rule. However, this process cannot necessarily proceed in the 
most obvious manner. In a classical CA, every cell is instan- 
taneously updated in parallel. We wish to replace this classi- 
cal cell update rule with a quantum analogue that acts appro- 
priately on the qudit lattice described above. For a quantum 
unitary operation to act as a quantum cell update rule, this 
operator needs to fulfill the following two restrictions: 

1. The operator must act on a finite subset of the lattice. 
Precisely U x : H{M X ) -> H(N X ) where Af x = Af + 
1 C Lis the finite neighborhood about the cell x. 



2. The operator must commute with lattice translations of 
itself. Precisely, we require that [U x , U y ] = for all 

x,y£Z n . 

The first condition is an immediate condition for any rule, 
quantum or otherwise, to qualify as a CA update rule. The 
second condition allows the operators U x for i £ Z" to be 
applied in parallel without the need to consider any special or 
particular ordering of the operators. 

It should be clear that any evolution defined in such manner 
represents a valid quantum evolution which can be ascribed to 
some physical system. The global evolution of the lattice can 
be described as 

u = l[u x , 

X 

which is well-defined, due to the two conditions given above. 

The question that remains is whether this model properly 
describes what we intuitively would regard as QCA. Properly, 
there are two questions: 

1 . Can all entities described by the model above be prop- 
erly classified as QCA? 

2. Can all systems that are identified as QCA be properly 
described in the model above? 

The answer to the first question is yes, since the update rules 
are local and can be applied in unison throughout the lattice. 
Also, the global unitary operator for the evolution of the lattice 
is properly defined and space-homogeneous, as desired. 

The answer to the second question is, unfortunately, no. We 
now present a simple system that one might consider to be a 
valid QCA, but cannot be described in the above model. 

The counterexample is as follows. We start with a one- 
dimensional lattice of qudits. For each lattice cell x G L, 
we associate with it a quantum state \ip x ) G H^. Although in 
general, the configuration of a QCA may not be separable with 
respect to each cell, the configuration can still be described in 
terms of a linear superposition of these separable configura- 
tions. Thus, it suffices to consider such configurations. 

At each time step we wish to have every value shifted one 
cell to the right. In other words, after the first update each cell 
x should now store the state \4> x -i). After k steps each cell 
x should contain the state \ip x -k)- In fact, such a transition 
function cannot be implemented by any local unitary process. 

To see why this is so, suppose that we had a transition func- 
tion /, which is the product of a finite number of operations, 
/ = /n/n-i ■ ■ ■ fi, sucn that each operator fj is the (poten- 
tially infinite) product of local unitary operators over disjoint 
neighborhoods. Note that this gives us the most general de- 
scription possible of a depth-n quantum circuit implementa- 
tion of this linear QCA using only local unitary operators. 
Now, consider an individual cell, xo. By analyzing the depen- 
dencies of the individual local unitary operators which make 
up the transition function /, it is possible to find a range of 
cells, P = {x : a < x < b} for some a, b G Z such that 
xq G P, and the value of the quantum state at cell xq after 
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the application of the transition function depends only on the 
cells of P. 

We now divide the transition function / into two functions, 
/ = hg, where g applies sufficiently many local unitary oper- 
ators from / over the cells of P so that the new value of the 
quantum state at cell xq is computed, without violating any 
of the dependency relations from /. Then, h simply applies 
the remainder of the local unitary operators, as appropriate. 
Note that since g necessarily contains any local unitary opera- 
tors from / which operate on the cell x , the operation h does 
not. Since h does not perform any operation between cells 
x < x and cells x > x , in order to implement the shift-right 
transition function, the cells {x : x < x < b} must contain 
enough quantum information after g has been applied to re- 
construct the information in the cells {x : xq < x < b}. This 
is clearly impossible. 

In order to resolve this issue, we need to analyze the classi- 
cal CA parallel update rules more closely. In the classical CA, 
the local update rule for a given cell reads the value of the cell, 
and the values of its neighboring cells. It performs a compu- 
tation based on these values, and then updates the cell's value 
accordingly. Herein lies the problem: read and update are 
modeled in a classical CA as a single atomic action that can 
be applied throughout the lattice in parallel simultaneously. 
However, in a physical setting, these two operations cannot 
be implemented in this manner. When simulating CA in clas- 
sical computer architectures, the canonical solution is to use 
two lattices in memory: one to store the current value, and 
one to store the computed updated value. Even if we consider 
hardware implementations of CA, these need to keep the val- 
ues of the inputs to the transition function while this function 
is being calculated. 

The formal CA model does not need to consider this im- 
plementation detail, as it is a mathematical construction and 
has no claims to directly model a physical system implement- 
ing a CA. When developing a QCA model, one cannot take 
the same liberty. The name itself, QCA, includes reference 
to an underlying quantum physical reality. It is our intention 
that this model faithfully, if abstractly, represents real phys- 
ical systems. Although there is some value in mathematical 
constructions which do not correspond directly to any physi- 
cal systems, this is not the goal of the constructions presented 
in this paper. 



C. A New Approach 

We now make an adjustment to our QCA model, given the 
importance of maintaining independent read and update op- 
erations. Instead of having one unitary operator replacing the 
single atomic operation in the CA model, we define our QCA 
update rule as consisting of two unitary operators. The first 
operator, corresponding to the read operation, will be as de- 
fined above: a unitary operator U x , x E L acting on the neigh- 
borhood Af x , which commutes with all lattice translations of 
itself, Uy, y e L. The second operator, V x , x e L, corre- 
sponds to the update operation, and will only act on the single 
cell x itself. 



The intuition is as follows: in our physical model, instead of 
having separate lattices for the read and update functions, we 
expand each lattice cell to also contain any space resources 
necessary for computing the updated value of the cell. The 
operator U x reads the values of the neighborhood Af x , per- 
forms a computation, and stores the new value in such a way 
that does not prevent neighboring operators U y from correctly 
reading its own input values. This allows each cell to be oper- 
ated upon independently, in parallel, without any underlying 
assumptions of synchronization. After all the operations U x 
have been performed, the second unitary V x performs the ac- 
tual update of the lattice cell. 

With this new model for the update operation, we can again 
approach the two questions given above as to whether this 
model adequately describes what we might intuitively regard 
as QCA. 

First, it is clear that all entities described by this updated 
model can still be properly classified as QCA. The local up- 
date rule R x = V X U X is still a valid quantum unitary opera- 
tion, and the global update rule 



R = VU = 




is both well-defined and space-homogeneous. 

Now, in order to properly investigate whether all physical 
systems which can be described as QCA can be described 
within this new model, it is necessary to verify the following: 

We must first compare our model to existing CA models, 
both classical and quantum, in order to ensure that our model 
subsumes all proper CA described in these models. Secondly, 
we must also show that any known physical system which be- 
haves according to quantum mechanics and satisfies the CA 
preconditions of being driven by a local, space-homogeneous 
interaction can be described by our model. 

As an example, the qubit shift-right QCA mentioned above 
can now be described in this model, by including ancillary 
computation space with each lattice cell. 

We will tackle this question in more depth in the upcoming 
sections. First, we present a formal definition of the QCA 
model which we will adopt, as described in this section. 

Definition 4 (QCA). A Quantum Cellular Automaton is a 5- 
tuple (L, S,7V, Uq, Vo) consisting of 

1. a d-dimensional lattice of cells indexed by integers, L = 

Z d , 

2. a finite set E of orthogonal basis states, 

3. a finite neighborhood scheme N C 7L d , 

4. a local read function Uq : (7Ys) 0A ^ — > and 

5. a local update function Vq : riy, — > We. 

The read operation carries the further restriction that any two 
lattice translations U x and U y must commute for allx,y G L. 
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Each cell has a finite Hilbert space associated with it 7Ys = 
span ({|x)}a; e s)- The reduced state of each cell p x is a density 
operator over this Hilbert space. 

The initial state of the QCA is defined in the following 
way. Let / be any computable function that maps lattice vec- 
tors to pure quantum states in (7is) 8>/c . where d is the di- 
mension of the QCA lattice, and k is the length of a side of 
a d-dimensional hypercube, which we use to define blocks 
that are initialized to particular states. Then for any lattice 
vector z = {z\k, z%k, . . . , z^k) G 7L d the initial state of 
the lattice hypercube delimited by {z\k, z 2 k, . . . , zjk) and 
({z 1 + l)k - 1, (z 2 + l)k — 1, . . . , (za + l)k - 1) is set to 
/(*)• 

Intuitively, each block represents a volume of the QCA that 
is initialized to a particular pure state. Each block is initial- 
ized independently. In particular, / can have a block size of 
one cell, initializing every cell to the same state in E. It can 
also have more complicated forms such as having every pair 
of cells in a one dimensional QCA initialized to some maxi- 
mally entangled state. Particularly useful are functions / that 
initialize a finite region about the origin to some interesting 
state — the input of the QCA — and the rest of the lattice to 
some quiescent state (see below). 

The local update rule acting on a cell x consists of the op- 
eration U x followed by the single-cell operation V x . Both U x 
and V x are restricted to being computable unitary operators. 
The global evolution operator R is as previously defined. 



D. Quiescent States 

Our QCA definition follows the classical CA convention in 
defining the model over an infinite lattice. However, we will 
often be concerned only with finite regions of the QCA. One 
reason, for example, is that any physical implementation of 
a QCA using quantum hardware will, by necessity, simulate 
only a finite region of the QCA. Another reason is for simulat- 
ing physical phenomena. For instance, in Section [Vl we will 
be interested in simulating finite size chains of spin-i parti- 
cles. 

Sometimes, it can be appropriate to simply use finite QCA 
with cyclic boundary conditions. In this case, we envision the 
lattice as a closed torus. This is a standard and well-known 
practice with CA. For example, we can use this technique 
if the spin chain we wish to simulate is closed, that is, it it- 
self wraps around. For other applications, this will not be ap- 
propriate, for example, when trying to simulate an open spin 
chain. This is a chain which does not wrap around, but rather 
has two distinct end points. Another example will be the spin- 
signal amplification algorithm in Section I VIII which uses a 
finite size cube ancilla system. 

In such cases, the most appropriate way to proceed is to 
make use of a quiescent state, which is a cell state that is guar- 
anteed to remain invariant under the update rule, regardless 
of the states of its neighbors. For instance, in the case of the 
finite spin-i chains, we can use three state cells. We use the 
state labels |+1) and | — 1) to refer to the presence of a spin-i 
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FIG. 2: Past lightcone of a region S: This represents a one- 
dimensional local unitary QCA. In order to obtain the state of the 
region of interest, the dark region at the bottom, one must consider 
not just the region itself, but anything that might affect the state of 
the region with the course of the simulation: its past lightcone. One 
may then trace out the unneeded regions. 

particle in a given cell position in the states i(l + a z ) and 
|(1 — a z ) respectively. A third state, labeled |0) denotes the 
absence of any particle in that cell location. One then need 
only ensure that the update rule correctly acts on states |+1) 
and | — 1), while leaving state |0) unaffected. 

Quiescent states are also very useful for the purposes of 
simulation, and physical implementation. Normally, if one is 
interested in the state of a region S of the lattice after k steps 
of the QCA update rule, one would need to look at the past 
lightcone of S. If the local update rule has a neighborhood 
of radius r, then one needs to include kr additional cells in 
each direction beyond the border of S. This is because any 
information in the past lightcone of S has the ability to affect 
cells within S, as shown in Figure [2] Note that since the size 
of the region needed by the simulation is determined by the 
number of time steps of the QCA we wish to simulate, one 
needs to fix the number of steps in the simulation beforehand. 
However, if a given QCA has a quiescent state, and all cells 
outside the finite region under consideration are initialized to 
this quiescent state, then the simulation of this QCA need only 
include this region for any number of simulated time steps. 



IV. QUANTUM CIRCUITS AND UNIVERSALITY 

In this section we explore two important aspects of the QCA 
model we introduced in Section [III] These aspects relate to 
QCA as a model of computation. First, it is important to show 
that QCA are capable of universal quantum computation. We 
demonstrate this using a simulation of an arbitrary quantum 
circuit using a two-dimensional QCA. 

We also show that any QCA can be simulated using families 
of quantum circuits. A quantum circuit is defined as a finite set 
of gates acting on a finite input. One can then define a uniform 
family of quantum circuits, with parameters S and t, such that 
each circuit simulates the finite region S of the QCA for t 
update steps. By uniformity we mean that that there exists an 
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FIG. 3: Quantum circuit simulation of a QCA update step: The dot- 
ted area represents the read phase. A read operator U must be ap- 
plied to each qubit, and its two neighbors. Since U commutes with 
its translations, we are at liberty to apply the U operators in any or- 
der. The update phase consists of the operator V being applied to 
every qubit. 



effective procedure, such as a Turing machine, that on input 
(S, t) outputs the correct circuit. 

We will show that our simulation is efficient, as defined in 
SectionU Specifically, in order to simulate a QCA on a given 
region, for a fixed number of time steps, we give a quantum 
circuit simulation with a depth which is linear with respect to 
the number of time steps, and constant with respect to the size 
of the simulated region. 



A. Simulation of QCA by Quantum Circuits 

We begin by showing the latter of the two results described 
above. We proceed incrementally, showing first how to pro- 
duce a quantum circuit that can simulate a single update step 
of a simple QCA. 

Lemma 1. Any finite region of a one-dimensional QCA with 
a symmetric neighborhood of radius one, where cells are in- 
dividual qubits, can be simulated by a quantum circuit. 

Proof. The simulation of an individual update step of this 
QCA is simple. Recall that the operators U x , each acting on 
3 qubits, all commute with each other. Therefore, the U x op- 
erators may be applied in an arbitrary order. The operators V x 
can all be applied to their respective qubits once all U x opera- 
tors have been applied. Figure[3]gives a visual representation 
of this construction. In order to simulate an arbitrary number 
of steps, we simply need to repeatedly apply the above con- 
struction. Finally, although we represented the operators U in 
our diagram as single, three-qubit operators, to complete the 
simulation we decompose U into an appropriate series of one 
and two qubit gates from a universal gate set. □ 

In the case of one-dimensional QCA with a nearest neigh- 
bor scheme, and cells consisting of one qubit, the operator U 
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FIG. 4: Decomposition of general audit gates. 



is simply acting on three qubits. Still this operator U needs 
to be decomposed into a series of one and two qubit gates 
U = U n U n -i . . . U2U1 taken from a set of universal gates. 

In order to extend the construction to allow cells with qudits 
of arbitrary dimension d, we first replace the single qubit wires 
in Figure[3]with qudit wires as in Figure|4] Then each gate U x 
and V x are decomposed into one and two qubit gates as in 
the aforementioned figure. The same construction technique 
works in order to deal with arbitrary dimensions, and arbitrary 
cell neighborhood sizes. 

Note that m and n are constants, determined by the struc- 
ture of the QCA. For very complicated QCA these constants 
can be potentially large. However, once the QCA has been 
defined these parameters are set, and hence do not asymptoti- 
cally affect the complexity of simulating a region of the QCA 
for a particular length of time. 

As our simulation above does not set a region size to be 
simulated, any region size can be simulated with an appropri- 
ate construction. An arbitrary number of time steps can be 
simulated by simply iterating the above construction. With 
this in mind, as well as the previous lemma, we can now state 
the following: 

Theorem 1. For every QCA Q there exists a family of 
quantum circuits, parameterized by (S, t), each acting on 
0{m log |S|) inputs, and with circuit depth 0(t) which simu- 
lates a finite region of S of Q consisting of m — \S\ cells, for 
t time steps 

This is a very important result, as it demonstrates that the 
local unitary QCA model does not admit automata which are 
somehow "not physical". More precisely, any behavior that 
can be described by a QCA can be described by the more tra- 
ditional quantum circuit model. Furthermore, such descrip- 
tions retain the high parallelism inherent to QCA. 



B. Simulation of Quantum Circuits by QCA 

Next, we show the converse result from the one above, thus 
showing that local unitary QCA are capable of efficient uni- 
versal quantum computation. 

Theorem 2. There exists a universal QCA Q u that can sim- 
ulate any quantum circuit with at most a linear slowdown, by 
using an appropriately encoded initial state. 

Proof. We proceed by constructing the QCA Q u over a two- 
dimensional lattice. We will basically 'draw ' the circuit onto 
the lattice. The qubits will be arranged top to bottom, and the 
wires will be visualized as going from left to right. 
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Each cell will consist of a number of fields, or registers. The 
cell itself can be thought of as the tensor product of quantum 
systems corresponding to these registers. 

The first register, the state register, consists of a single qubit 
which corresponds directly to the value on one of the wires of 
the quantum circuit at a particular point in the computation. 
This value will be shifted towards the right as time moves for- 
ward. 

Next is the gate register. This register will be initialized to a 
value corresponding to a gate that is to be applied to the state 
register, at the appropriate time. 

There is also a clock register, which will keep track of the 
current time step of the simulation. There are two phases to 
the simulation, an 'operate' and a 'carry' step. 

There is finally a single qubit active register, that keeps a 
record of which cells are currently actively involved in the 
computation. This register is either set to true ox false. 

The local read operator U x proceeds as follows. The neigh- 
borhood scheme is the von Neumann neighborhood of radius 
one, i.e. the cells directly above, below and to either side of 
the cell. The read operator acts non-trivially only on the one 
cell directly above, and the one directly to the left. However, 
the bigger neighborhood is needed to ensure unitary evolution, 
and translation invariance. 

If the clock register is set to operate, then a quantum gate 
is applied to the state register of the current cell (and possibly 
the state register of the upwards neighbor). For this, we fix a 
finite set of universal gates consisting of the controlled phase 
gate and some set of single-qubit operators. The choice of 
the controlled phase gate, as opposed to say controlled not, is 
to ensure that U x commutes with translations of itself. Any 
one-qubit unitary gates that form a universal set will work. 

If the clock register is set to carry, then the state register 
will be swapped with the state register of the left neighbor if 
and only if the following conditions occur: the active register 
is set to true on the left neighbor, and set to false on the current 
cell, and the clock register is set to carry on all the neighbors 
(above, below, and to either side). These extra checks are re- 
quired to ensure the operator U x commutes with translations 
of itself. 

Figure|5]gives a visual representation of the update rule op- 
erator U x . Operator V x simply updates the clock register, ap- 
plying a NOT gate at each time step. 

Finally, the initial state is set as follows. There is one hori- 
zontal row for each wire in the quantum circuit. Every column 
represents a time step in the quantum circuit. The cells are ini- 
tialized to have their gate registers set to the appropriate gate, 
if there is a gate, in the wire corresponding to its row, and in 
the time step corresponding to its column. The clock register 
is set to operate, and the state register is set |0) initially on all 
cells. The first column of the quantum circuit is set to active, 
all other cells are set to inactive. 

This construction can only natively simulate circuits with 
nearest-neighbor gates. In order to encode arbitrary circuits, it 
is necessary to translate the circuit into one using only nearest- 
neighbor gates by adding swap gates where needed. This is 
the cause of the worst-case linear slowdown, mentioned in the 
statement of this theorem. □ 
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FIG. 5: Universal QCA update rule. 



The previous result is important in that it proves that the 
QCA model is computationally complete. It also gives a 
recipe for implementing quantum circuit algorithms on two- 
dimensional QCA. It is important to mention that it is also 
possible to show that one-dimensional LUQCA are universal 
for quantum computing. For a complete proof see [26]. 

In the following sections, by showing how physical sys- 
tems can implement QCA, we complete a formula for imple- 
menting quantum algorithms on physical systems using QCA 
methods. We will see, however, that the strongest virtue of this 
QCA model lies not in its ability to simulate quantum circuits. 
Rather, it lies in the algorithms that take natural advantage of 
the QCA structure. 



V. MODELLING PHYSICAL SYSTEMS 

We stated before that one of our goals in developing a QCA 
formalism is to create a useful modeling tool for quantum sys- 
tems. Classical CA are used for simulating various phenom- 
ena based on classical information, such as sea ice formation, 
fluid dynamics, or voter systems yl |25[]. Similarly, we ex- 
pect QCA to be able to model different types of physical sys- 
tems based on quantum information, with dynamics which are 
based on time and space homogeneous local interactions. 

Physical systems that fall in this category include Ising 
and Heisenberg interaction spin chains, solid state NMR, and 
quantum lattice gases. We will be looking at some of these 
systems in this section. 
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A. Spin Chains 

Spin chains are perhaps the most obvious candidate for 
physical systems being modeled with QCA. Indeed, Ising in- 
teraction spin chains, and in general, any spin chain with a 
coupling Hamiltonian which commutes with its own lattice 
translations can be implemented easily. 

Suppose we have a linear spin chain of length N, indexed 
by n e Z. Each spin n is coupled to its nearest neighbor 
n + 1, with a coupling Hamiltonian Jcri™' ai n+1 \ where J is 
the coupling strength constant. Note that the coupling Hamil- 
tonian does commute with its lattice translations. The Hamil- 
tonian for the entire spin chain is: 

JV-l 
n=l 

It is a simple matter to give a discrete time approximation 
to such a spin chain. First, we fix a time step interval At. Our 
QCA model will allow for simulation of the spin chain for 
time steps in multiples of At. Hence, while the choice of At 
is arbitrary, it is important in determining the resolution of the 
simulation. 

For a simulation of the Ising spin chain, the QCA lattice 
consists of a one-dimensional array, where each cell is a single 
qubit. The neighborhood of each cell n simply consists of the 
cell and its right neighbor n + 1. The local rule operator U n 
is given as: 



Uai n) <rl n+1 > At 



The operator V n is simply the identity operator. Note 
that the operator U n commutes with its translations, that is, 
[U n ,U m ] = 0, for all n,m e Z. Furthermore, the global 
operator 



JV-l 



u=]]u n 



satisfies 



n=l 



_ —iHiAt 



U = e 



Hence, the QCA construction faithfully simulates the Ising 
spin chain for times that are integer multiples of At, as de- 
sired. 

A more complicated endeavor is to construct a QCA sim- 
ulation of a spin chain whose coupling Hamiltonians do not 
commute with each other. In particular we examine the 
Heisenberg spin chain as an example. Let the coupling Hamil- 
tonian between spins n and n + 1 be 



(n,n+l) 



H 



1) 



Here, note that H, 



(n,n+l) 
H 



does not commute with its trans- 



lations fj^ n - m+1 \ The Hamiltonian of the total system is 



JV-l 



(n,n+l) 



A QCA simulation of the Heisenberg spin chain presented 
above is still possible, however, with the help of two powerful 
tools: Trotterization, and cell coloring. The first technique 
is well known in physics, the second is a tool developed for 
QCA. Together, they allow for simulation of complicated and 
almost arbitrary Hamiltonians by QCA. 

Trotterization is a technique by which a Hamiltonian is ap- 
proximated using a combination of non-commuting Hamil- 
tonians whose sum adds up to the original Hamiltonian. In 
other words, it is possible to approximate with bounded error 
the evolution due to the Hamiltonian H = H a + Hb by al- 
ternately evolving the system under the Hamiltonians H a and 
Hb even if these two do not commute. Precisely, we can give 
a first-order approximation 



-i(H a + H b )At 



-iH a At/k -iH b At/k 



+ 5. 



In the case that \\[H a ,H b }\\ At 2 < 1, the error S is 
0(At 2 /k). Higher order techniques can achieve error rates 
of (9(At m + l/fc m ) at the cost of using 0(2 m ) gates. Though 
the number of gates increases exponentially, the time required 
for each gate decreases exponentially as well. 

In the case of our QCA simulation of the Heisenberg spin 
chain Hamiltonian Hu above, we have: 



E 

71=1 



H 



(2n-l,2n) 
H ' 



and 



E 

71=1 



H, 



(271,271+1) 



Note that Hn = H a + Hb- The Hamiltonians H a and 
Hb consist of the couplings from the even spins to their right 
neighbors and left neighbors respectively. 

Our QCA evolution will consist of alternately evolving the 
lattice cells under H a and Hb, using a technique called cell 
coloring. Each cell will have two fields. The first field is 
a state register, consisting of one qubit, which will hold the 
state of the spin represented by the cell. The second field, 
called the active color register, will also consist of a single 
qubit. Initially, the color register of each cell n is set to the 
value n mod 2. 

The QCA lattice used in this simulation is also one- 
dimensional, and the neighbor set of each cell includes both 
the cell to the immediate right, and the immediate left of the 
given cell. Let, U' n be the Trotter step acting on the current 
cell state register and the right neighbor state register. Using 
the first order approximation, we have 



UL =e 



-iH 



(n,n + l) 



At/fc 



71=1 



for an appropriate value k. It is also possible to use higher 
order approximations. 

The local update rule operator U n then consists of applying 
the operator U' n if and only if the current cell's active color 
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register is set to one, and both left and right neighbors have 
their color registers set to zero. The operator V n simply tog- 
gles the active color register. 

It should be clear that this QCA construction simulates the 
Heisenberg spin chain. Moreover, by using an appropriate 
operator U' n , it is possible to simulate any Hamiltonian with 
nearest neighbor couplings with this technique. 

It is appropriate here to mention that one-dimensional spin 
structures such as these can be efficiently simulated using 
classical computers 12711 . There are also efficient ways to 
calculate the lowest energy eigenstates and eigenvalues us- 
ing classical numerical techniques such as the Density Ma- 
trix Renormalization Group (DMRG) method l28ll . This, of 
course, also implies that the QCA presented in this section 
can be efficiently solved and simulated classically. 

However, this conclusion cannot be easily generalized to 
larger classes of QCA. First, we note that we have used 
one-dimensional spin networks here for expository purposes. 
From our constructions, it should be clear that these QCA sim- 
ulations generalize easily to higher dimensions. On the other 
hand, no efficient classical simulation is known for spin net- 
works of dimension higher than one. 

Also, it is not known whether arbitrary one-dimensional 
LUQCA can be simulated efficiently classically. In fact, due 
to the universality of one-dimensional LUQCA ll26ll . this will 
not be the case unless classical computers can efficiently sim- 
ulate quantum systems (BPP = BQP), which is generally 
regarded as unlikely. 
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FIG. 6: Quantum walk on a lattice. 



lattice can have any number of particles in total. 

The construction is as follows. The QCA Qw is one- 
dimensional. Each cell has two single-qubit registers, called 
Up and Down. Each register will represent the presence of a 
particle in the lattice site, with the appropriate spin, by being 
in the state |1), and the absence of the corresponding particle 
by being in the state |0). 

The local update operator U x acts on the down register of 
the current cell, and up register of the right neighbor, swap- 
ping the two values. Operator V operates on both fields of the 
cell with operator 



V x 



( 1 

p q 
\ c/) 



B. Quantum Lattice Gases 

Quantum lattice gases have been studied for over a decade 
now JMOIl^llSEllQ. In essence, they are the quantum 
analog of classical lattice gases. The basic principles are the 
same in both the classical and quantum cases: one starts with 
a discrete CA-based model that describes particles on the lat- 
tice, and their movement. One can then take the continuous 
limit of such CA and show that in this limit, the behavior of 
the CA mimics a well-known differential equation. 

Taking the continuous limit of a classical CA is a well 
known procedure. It involves giving the lattice a physical in- 
terpretation, where each cell is thought to represent a point in 
space. The distance between two adjacent cells is taken to be 
Aa; and each time step of the CA is assumed to take At time. 
One then takes the limit, in a well prescribed manner, where 
Ax — > and At — > 0. There exist classical CA whose con- 
tinuous limits represent gas diffusion, as well as various other 
fluid dynamics yfl. 

In the quantum case, Meyer and Boghosian and Taylor 
ifioll give a construction of a quantum lattice gas whose con- 
tinuous limit is the Schrodinger equation for a freely moving 
particle. We now show how any type of lattice gas can be 
represented under the local unitary QCA model. 

We begin by introducing the Quantum Walk QCA Qw- 
This QCA models multi-particle quantum walks on a lattice. 
Each cell is allowed to have two particles, in orthogonal states 
(these two states can be thought of as orthogonal spins). The 



where p, q G C satisfying \p\ 2 + \q\ 2 = 1, pq + pq = and 
\4>\ 2 = 1. The update rule is summarized in Figure[6] 

The dynamics of this QCA are the same as the lattice gas 
described by Boghosian and Taylor in 1 1 Ofl . 

Let ^ u (x,t) and ^d{ x > t) be the amplitude corresponding 
to the presence of a particle with spin up and spin down re- 
spectively in cell position x, at time t. Let ^(x,t) be the total 
amplitude corresponding to the presence of a particle in cell 
x at time t, that is * (x, t) = $> u (x, t) + ^a(x, t). Then, we 
have that 



$> u (x, t + At) = q^ u (x - Ax, t) + p^ d (x + Ax, t) 



and 



■$ d (x, t + At) = q^ d (x + Ax, t) + p^ u (x - Ax, t). 



We can proceed according to Boghosian and Taylor [10], 
and take the continuous limit of our QCA Ax 2 — » and 
At — ► 0, using the Chapman-Enskog method [3]. Doing so 
reveals that fy(x, t) obeys the equation: 

which is the equation for a freely moving particle of mass 
m = ip/qm one dimension. 

Using the same construction techniques, we can also de- 
scribe a freely moving particle in two or three dimensions. We 
can construct QCA that simulate other quantum lattice gases 
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like the ones proposed in [9]. Most, if not all, quantum lat- 
tice gases, whether single or multi-particle, can be described 
as local unitary QCA. 

This concludes our discussion on the expressive powers of 
the QCA model presented here. In the next section we con- 
tinue with a discussion of how to take these mathematical 
models and implement them in quantum hardware. 



VI. QUANTUM COMPUTATION 

In previous sections, we discussed how our unitary QCA 
can be used to model physical systems, and how universal 
quantum computation can be accomplished using only QCA 
primitives. In this section we will look into bridging the gap 
by using QCA as a theoretical framework for implementing 
quantum computation. 

A clear advantage of working in the QCA model over quan- 
tum circuits, in regards to physical implementations of quan- 
tum computation, is that QCA make considerably fewer de- 
mands on the underlying hardware. In particular, as opposed 
to direct implementations of quantum circuits, the global evo- 
lution of the lattice in the QCA model does not assume inde- 
pendent control over qudits. Rather, all qudits are to be ad- 
dressed collectively in parallel. However, it should be noted 
that the models of cellular automata described in this paper do 
not explicitly address the issue of initialization. Any proposed 
physical realization of the QCA must also describe the set of 
initial states which are constructible. This may require some 
degree of non-global control over a physical apparatus, such 
as having individual cells initialized to a certain basis state, or 
it may require some interaction with the environment, such as 
having the lattice cooled to a ground state. 

The QCA model also more closely resembles what is cur- 
rently achievable in several current quantum computer imple- 
mentations. For example, if qudits are represented by physi- 
cal spins, and the control of such spins is achieved through the 
use of magnetic pulses, as is the case in NMR or ESR, then 
it is more reasonable to consider all spins as being subjected 
to the same pulse sequences, rather than having the ability to 
address spins individually. The same can be said about many 
other physical quantum computer proposals. 

In this section we will concentrate on implementing QCA 
on NMR, since most of the groundwork for this implementa- 
tion has already been laid out. 



A. Colored QCA 

In Section W\ we considered cell coloring as a useful QCA 
programming technique. As with other computation models, 
where a programming technique can be formalized into its 
own subset model and then shown to be equivalent to the gen- 
eral model (such as multi-track Turing machines), we can do 
the same with colored QCA. 

First, we will define the notion of a symmetric transition 
function for QCA. It is the quantum analog of symmetric 



CA, in which the transition function depends only on the to- 
tal number of neighboring cells in each of the possible cell 
states. Essentially, a transition update function is symmetric 
when it affects only the value of the target cell in a manner 
which depends only on how many of the cell's neighbors are 
in particular states, rather than on which state any particular 
neighbor is in. 

Definition 5. Given a QCA Q — (L, E, jV, Uo, Vq), we call 
the update function Uo : (Ws)®^ — > ('He)®'^ symmetric if 
it can be expressed as a collection of single-cell operations 
on cell controlled by the computational basis states of the 
neighborhood Af\ {0}, and Uo commutes with every operator 
SWAP Xi y, which simply swaps the contents of cells x and y, 
where x,y G Af \ {0}. If Q has a symmetric update function, 
then we call Q a symmetric QCA. 

Next, we wish to formalize the notion of a colored QCA. 
For this model, we will fix the neighborhood scheme to in- 
clude only directly adjacent cells. That is, JV = {x € Z d : 
\\ x \\i < !}■ However, first we will define the set of permissi- 
ble colorings of a lattice. 

Definition 6. Given a lattice L = 7L d and a neighborhood 
scheme M , we define a correct k-coloring for a lattice as a 
periodic mapping C : L — > {0, 1, . . . , k — 1}, such that no 
two neighboring cells in L are assigned the same color. 

We may think of cell color as an inherent property of each 
cell. However, it may also be helpful to consider cell color 
as classical information which is being stored with each cell 
in such a way that the local transition function does not al- 
ter this information. We can now finally give a definition for 
the colored QCA. Recall that the neighborhood scheme Af is 
fixed. 

Definition 7 (CQCA). A Colored QCA or CQCA is a 5-tuple 
(L,C,"E,U,c) consisting of a lattice L = Z d , a correct k- 
coloring C, a finite set E of cell states, a sequence of T 

symmetric unitary operators Li — (u^\ U^\ . . . , Uq T 1 M, 

withU^ : (Hs)®"^^ (Hs)®^ , and a sequence of T colors 
c = (co, C\, . . . , C(r-i)), labeled by integers < Cj < k. 

(i) 

The local transition operation consists of applying U x to 
each cell x with color C (x) — Cj at time step t = j + nT, 
where < j < T and n G Z. 

Note that since C is a correct fc-coloring, any two operators 
Ux acting non-trivially on two cells of the same color at the 
same time will commute. 

CQCA can be simply considered as a shorthand for the cell 
coloring technique we introduced in Section [V] As such, it 
should be clear that CQCA are a subset of unitary QCA. 

Theorem 3. For every CQCA Q there is a QCA Q' that sim- 
ulates the same evolution exactly. 

Proof. We may incorporate the color information of each cell 
of the CQCA Q within an additional color register for each 
cell of the QCA Q'. Now, it suffices to add one extra clock 
register to each cell, initialized to 0. The update operator U x 
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simply applies Ux conditional on both C(x) and the clock 
register of cell x being set to j. In order to ensure that U x 
commutes with its translations, we must ensure that the colors 
of all the neighbors of x are consistent with the coloring C be- 
fore applying the appropriate operator. Otherwise, U x should 
act as the identity operator. The read operator V x simply in- 
crements the clock register, modulo T. □ 

What is more surprising is the converse result: that all uni- 
tary QCA can be rephrased in the CQCA formalism. 

Theorem 4. For every QCA Q there is a CQCA Q' that sim- 
ulates the same evolution exactly. 

Proof. Given the QCA Q = (L, E,7V, U , V ), we will use 
the same lattice L and alphabet S. The neighborhood scheme 
for the CQCA, M 1 is fixed by definition. We also need to 
provide a correct fc-coloring of the lattice. To this end, it suf- 
fices to provide a coloring C with the property that that no 
neighborhood M x of Q or J\f' x of Q 1 contains two cells with 
the same color. Now, we need to construct a sequence U of 
update operators. Note that single-qudit operations on x and 
the controlled- NOT operation targeting x are symmetric op- 
erations, since any two cells belonging to the same neighbor- 
hood have different colors, by construction. Now, given an 
implementation of the unitary update operation Uo of Q us- 
ing single-qudit and nearest-neighbor controlled- NOT oper- 
ations, we can give a sequence of symmetric operations which 
perform (7o on a neighborhood M x of a cell a; of a specific 
color. By performing a similar sequence of operations for 
each color in our coloring C, we effectively perform U x for 
each cell x. Since each update operation U x commutes with 
the other update operations, we have effectively simulated the 
update transition operation of Q. Finally, we can perform the 
single-qudit operations V x on each cell. □ 

This last result is of major importance as it allows us to im- 
plement any unitary QCA algorithm on a pulse-driven quan- 
tum computer, as proposed by Lloyd fl aj . and further devel- 
oped by Benjamin IU4. [1511 and others 13311 . The scheme in- 
volves using large molecules comprised of two or more dif- 
ferent species of spin-i particles, arranged in repetitive struc- 
tures, such as crystals or polymers, to store the quantum data. 
It then evolves the system using series of magnetic pulses that 
address all spins of any one particular species. 

To implement a given QCA in the pulse-driven computation 
model, we first convert the QCA into one which uses a two- 
state alphabet. This can be done by expanding each cell into 
[log | £ | ] cells to encode the states of X with a binary alphabet, 
then adjusting the neighborhood scheme M accordingly. We 
then apply the construction in Theorem|4] With this, and the 
techniques of Lloyd et. ai, it would be possible to implement 
any QCA algorithm using NMR and an appropriate molecule. 

We choose NMR and pulse driven quantum computing 
devices to show a physical implementation of local unitary 
QCA. However, this should not be taken to be the only pos- 
sible implementation of QCA. There are many other physical 
systems, like optical lattices 13411 . cavity QED, among others 
051 13611 . that seem better suited to implementing QCA, rather 
than the more traditional quantum circuits. 



VII. ALGORITHMS 

We have seen two practical applications that can be 
achieved with an implementation of QCA in the laboratory. 
First, there are numerous physical systems that can be natu- 
rally simulated using the QCA model. Second, one can also 
achieve universal quantum computation by simulating quan- 
tum circuits on a QCA. 

While these are both interesting and important applications 
of QCA, a very important application in the future of QCA 
will be the development and implementation of true, native, 
QCA algorithms. 

We saw in Section [TVl how a quantum circuit can simulate 
any QCA, and how a QCA can simulate any quantum circuit. 
However, these simulations come at a cost of a linear-time 
slowdown going in either direction. While this slowdown is 
not as important a concern in terms of asymptotic complexity, 
in current laboratory conditions, any source of slowdown is to 
be avoided. 

In the next section we analyze a problem that is particularly 
well-suited to a natural solution using QCA, and we show how 
the tools that we have developed thus far can be used effec- 
tively to provide an optimal solution to the problem. 

A. Spin Signal Amplification Algorithm 

We present a description of the problem in simple abstract 
terms. Suppose we want to amplify the signal from a single 
spin-i particle. That is, we have a single spin-^ particle, and 
we want to create a large ensemble of spins whose bulk an- 
gular momentum resembles the original spin in a particular 
basis. Note that this is not cloning, since a basis needs to be 
set beforehand. Succinctly, we want a unitary procedure U 
that maps the state 

(a 10) +/?!!)) ® \0f N 
Amplified Spin Ancilla 

to the state 

q|0> ®(jv+i) +/3}1) ® C iv + i) ! 

where |0) and |1) form the basis in which we wish to am- 
plify. The main application of such an algorithm is to per- 
form a measurement in situations where bulk magnetization 
is needed in order to achieve a detectable signal, such as with 
NMR. Hence, the algorithm needs to be extremely efficient: 
the whole procedure needs to be completed before decoher- 
ence can destroy the desired value. The value N will also 
need to be reasonably large, on the order of 10 7 or 10 8 , in 
order to get a reasonable signal in NMR. 

Figure [7] shows a simple quantum circuit solution. How- 
ever, this circuit approach does have several shortcomings. 
First and foremost, it requires individually addressing dif- 
ferent spins. For large N, in most laboratory conditions, this 
is not feasible. Supposing that one could get around this first 
hurdle, one would still need to perform N independent gates 
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FIG. 7: A simple quantum circuit that implements U. 

before decoherence destroys the data. Again, this is not likely 
to be feasible in most experimental settings. 

The QCA approach is simple, elegant, and optimally effi- 
cient. In order to develop the algorithm we will make use of 
the colored QCA developed earlier. We will use a two-color 
(black and white), three-dimensional QCA. Since we are de- 
scribing an algorithm that has to be implemented in an actual 
physical setting, we will be using a finite-sized workspace. 
In order to describe this workspace, we will use three-state 
qudits for our cells: the logical states |+1) and | — 1) will be 
used to denote the presence and the spin of a spin-i particle 
in the corresponding state, while the state |0) will denote the 
absence of any particle. This state |0) will be quiescent, as 
defined in SectionHIH 

Every cell in the QCA is initialized to a state |0) except for 
a perfectly cubic region of volume roughly 2N. The cube will 
have its cells initialized to the value |— 1), except for the top- 
front-left corner of the cube, whose value will be initialized to 
the state \ip) which we wish to amplify. We will use this top, 
front, left portion of the cube as an ancilla system. 

As we are using a colored QCA, the neighborhood of each 
cell is fixed to be the set of cells with Manhattan distance 1 
from that cell. Hence, each cell has only neighbors of the op- 
posing color. We also need to provide an update rule which 
is color-symmetric. For both colors, the update rule is as fol- 
lows. We apply a NOT gate (which maps |+1) to |— 1), | — 1) 
to |+1) and leaves the quiescent state |0) untouched) if and 
only if the set of neighbors of a cell have values which sum 
to —2, —1, or 0. It can be shown that this update rule, when 

applied repeatedly for O (^NJ time steps, will achieve the 

desired result. 

We can make a few simple observations about the algo- 
rithm. First, as a native QCA algorithm, it does not require 
individual spin addressability. The algorithm is optimally ef- 
ficient, if we allow only the use of local operations, in at most 
three dimensions. 

It should also be noted that the problem of single-spin mea- 
surement in NMR is generally considered to be a difficult one; 
the fact that the exposition of the algorithm presented here is 
simple and succinct is due to the development of the theoreti- 
cal tools earlier in this work. 

It is important to add that it is possible to implement this 
algorithm in solid state NMR by adapting some of the tech- 
niques presented above, and applying some clever manipu- 



lations. For a full description of this algorithm, including a 
discussion on physical implementation see ll2lll2a, l37ll . 

VIII. PREVIOUS QCA MODELS 

In this section, we will present a number of other models of 
QCA that have been developed, and we will relate them to our 
proposed model. 

A. Watrous-van Dam QCA 

The first attempt to define a quantized version of cellular 
automata was made by Watrous [6], whose ideas were further 
explored by van Dam |5|], and by Diirr, LeThanh and Santha 
ll38i l39tl . The model considers a one-dimensional lattice of 
cells and a finite set of basis states E for each individual cell, 
and features a transition function which maps a neighborhood 
of cells to a single quantum state instantaneously and simulta- 
neously. Watrous also introduces a model of partitioned QCA 
in which each cell contains a triplet of quantum states, and a 
permutation is applied to each cell neighborhood before the 
transition function is applied. 

Definition 8. A Watrous-van Dam QCA, acting on a one- 
dimensional lattice indexed by Z, consists of a 3-tuple 
(E,A/" ; /) consisting of a finite set E of cell states, a fi- 
nite neighborhood scheme N ', and a local transition function 
f : E^ -» He. 

This model can be viewed as a direct quantization of the 
classical cellular automata model, where the set of possible 
configurations of the CA is extended to include all linear su- 
perpositions of the classical cell configurations, and the local 
transition function now maps the cell configurations of a given 
neighborhood to a quantum state. In the case that a neighbor- 
hood is in a linear superposition of configurations, / simply 
acts linearly. Also note that in this model, at each time step, 
each cell is updated with its new value simultaneously, as in 
the classical model. 

Unfortunately, this definition allows for non-physical be- 
havior. It is possible to define transition functions which do 
not represent unitary evolution of the cell tape, either by pro- 
ducing superpositions of configurations which do not have 
norm 1, or by inducing a global transition function which is 
not injective, and therefore not unitary. In order to help re- 
solve this problem, Watrous restricts the set of permissible 
local transition functions by introducing the notion of well- 
formed QCA. A local transition function is well-formed sim- 
ply if it maps any configuration to a properly normalized lin- 
ear superposition of configurations. Because the set of config- 
urations is infinite, this condition is usually expressed in terms 
of the £2 norm of the complex amplitudes associated with each 
configuration. 

In order to describe QCA which perform unitary evolution, 
Watrous also introduces the idea of a quiescent state, which 
is a distinguished element e e E which has the property that 
/ : i— > . We can then define a quiescent QCA as a 
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FIG. 8: Watrous Partitioned QCA. 
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QCA with a distinguished quiescent state acting only on finite 
configurations, which consist of finitely many non-quiescent 
states. It can be shown that a quiescent QCA which is well- 
formed and injective represents unitary evolution on the lat- 
tice. Also, note that this notion of a quiescent state is slightly 
different than the one introduced in Section Hill 

In order to construct examples of valid QCA in this model, 
Watrous also introduces a model of partitioned QCA, in which 
each cell consists of three quantum states, so that the set of 
finite states can be subdivided as S = S; x S c x S r . Given 
a configuration in which each cell, indexed by k S Z, is in 

the state (qjl? ,qj?\qjp), the transition function of the QCA 
in one time step first consists of a permutation which brings 

the state of cell k to {qjj_i, q k c \ Q^+i) for each k G Z, then 
performs a local unitary operation Vk on each cell. 

Watrous shows that this model of partitioned QCA can be 
used to simulate a universal quantum Turing machine with 
polynomial overhead. 

The partitioned QCA model given by Watrous can also be 
expressed in the Local Unitary QCA model. First, suppose 
|Ej| = |S r |. If this is not the case, we can pad the smaller 
set with unused symbols so that both sets are of the same size. 
Then, we separate the permutation into an operation Pi which 
operates on two consecutive cells, mapping 

p • r„W J c i n^h (JQ J c i „M ^ 
n- [Q k -Ik ^1k idQk+i'Qk+vlk+i) 

h-> (n (l) n {c) a [l) \ (n (r) a (c) n {r) \ 

followed by an operation Pi which operates on a single cell, 
mapping 

^2 ■ (q k ,q k ,% ) H \% '1 k 1% )■ 

Note that P^Pi performs the desired permutation, and also 
that Pi commutes with any lattice translation of Pi . Now, we 
can express the Watrous partitioned QCA in our QCA model 
by setting U' = Pi and V' = VP2, as shown in Figure [9] 



FIG. 9: Watrous QCA expressed as a Local Unitary QCA. 



B. Schumacher- Werner QCA 

Schumacher and Werner |4] take a different approach in 
the definition of their model of QCA, working in the Heisen- 
berg picture rather than the Schrodinger picture. They intro- 
duce a comprehensive model of QCA in which they consider 
only the evolution of the algebra of observables on the lat- 
tice, rather than states of the cell lattice itself. By extending 
local observables of the cell lattice into a closed observable 
algebra, the Schumacher- Werner model has a number of use- 
ful algebraic properties. In this model, the transition function 
is simply a homomorphism of the observable algebra which 
satisfies a locality condition. Schumacher and Werner also in- 
troduce a model of partitioned QCA called the Generalized 
Margolus Partitioned QCA, in which the observable algebra 
is partitioned into subalgebras. This generalizes the Margolus 
scheme, as described in Section |IIJ in which the cell lattice 
itself is partitioned. 

In order to avoid problematic issues dealing with observ- 
ables over infinite lattices, Schumacher and Werner make use 
of the quasi-local algebra. In order to construct this algebra, 
we first start with the set of all observables on finite subsets 
S C L of the lattice, denoted A(S), and extend them appro- 
priately into observables of the entire lattice by taking a tensor 
product with the identity operator over the rest of the lattice. 
The completion of this set forms the quasi-local algebra. 

In this setting, the global transition operator of a QCA is 
simply defined as a homomorphism T : A(L) — > A(L) over 
the quasi-local algebra which satisfies two specific proper- 
ties. First, a locality condition must be satisfied: T(A(S)) C 
A(S + A/") for all finite SQL. Secondly, T must commute 
with lattice translation operators, so that the QCA is space- 
homogeneous. Now, the QCA can be defined in terms of the 
lattice L, the neighborhood scheme Af, the single-cell observ- 
able algebra, Ao, which takes the place of the alphabet, and 
the global transition operator T. 

The local transition operator of a QCA is simply a homo- 
morphism Tq : Ao —> A(Af) from the observable algebra of 
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a single distinguished cell S L to the observable algebra of 
the neighborhood of that cell. Schumacher and Werner show 
that a local homomorphism Tq will correspond uniquely to a 
global transition operator T if and only if for each x € L, 
the algebras Tq(Ao) and T x (To (Ao ) ) commute elementwise. 
Here, t x is a lattice translation by x. The global transition 
operator T given by To is defined by 

T(A(S)) = J] Tx{A x ). 

Next, we will describe the Generalized Margolus Parti- 
tioned QCA. Schumacher and Werner present this partitioned 
scheme as a method of producing valid reversible QCA in 
their general model. In order to describe this scheme, we will 
proceed according to the definition of a classical partitioned 
CA, as given in Section HT1 

We start with the <i-dimensional lattice L = Z d , and we fix 
the sublattice S = 2Z d as the set of cells of L with all even 
co-ordinates. We also fix the time period as T — 2. The block 
scheme, B is given as {Bq, Bi}, which is given as 

Bo = {(ici,af2, ...,Xd) e L : < Xj < 1,1 < j < d}, 

which is simply a cube of size 2 d with corners at cells = 
(0,0, . . . ,0) and 1 = (1, 1, . . . , 1), and 

B 1= B a + 1, 

which is simply a translation of the cube Bq. 

Now, as in the regular Schumacher- Werner QCA model, we 
proceed in the Heisenberg picture. For any block Bq + s, s € 
S, we have 2 d intersecting blocks from the partition B\ + S. 
For each block B\ + s' which intersects with Bq + s, there 
is a vector v G 7L d representing the translation taking Bq + s 
to B x + s', so that B x + s' = B Q + s + v. Indeed, these 2 d 
intersecting blocks may be indexed by the vectors v, which are 
simply all vectors of Z d whose entries are each ±1. Hence, 

(s) 

we will set By = B + s + v. 

For each block B v °\ we will fix an observable algebra B^ 
as a subalgebra of the observable algebra A(B^ ) for the en- 
tire block. Then, for each block , the observable algebra 
is simply the appropriate translation of Note that, in 
particular, the observable algebra for the block Bi+s = B\ 

A(Bi ), contains each of the observable algebras b{? +1 . 
In order for an assignment of subalgebras to be considered 

valid, these subalgebras i3« s+1 _ ^ must commute and span 
A(B^ ). This occurs if and only if the product of the dimen- 
sions of these algebras is |E| 2 . 

The transition function then consists first of an isomor- 
phism 

T^:A(B^)^]jB^\ 

V 

followed by the isomorphism 

Ti s) ■ l[Bi s+1 - v ^A(B^y 



Note that since To and T\ are isomorphisms between observ- 
able algebras of equal dimension, with an appropriate choice 
of basis, they can be represented by unitary operators f/o and 
U\ which map vectors from a complex vector space to another 
complex vector space of equal dimension. However, they do 
not represent local unitary evolution, since these complex vec- 
tor spaces are used to describe two different quantum systems. 
For example, the Shift-Right QCA, which was shown in Sec- 
tion||ll]to be impossible to implement using local unitary op- 
erators, can be constructed in the Generalized Margolus Parti- 
tioning QCA model. 

Fortunately, it is possible to simulate the Generalized Mar- 
golus Partitioning QCA model within the Local Unitary QCA 
model by adding 2 d memory registers to each cell correspond- 
ing to the subalgebras B v in addition to a clock register indi- 
cating which of the two stages of the transition function is 
being performed. The transition function of the Local Uni- 
tary QCA simply swaps the contents of the data registers of 
each cell with the appropriate memory registers before apply- 
ing the unitary operations corresponding to the desired iso- 
morphisms. 



C. Other Models 

Meyer [ ioll . Boghosian and Taylor IToll . Love and 
Boghosian [31], among others explored the idea of using QCA 
as a model for simulating quantum lattice gases. As classical 
CA are used to model classical physical systems, it is natu- 
ral to develop QCA models which are capable of modelling 
quantum physical systems. In order to simulate lattice gases, 
Meyer uses a model of QCA in which each lattice cell is rep- 
resented by a computational basis state in a Hilbert space, and 
the set of states which a given cell can take is replaced with a 
complex number representing the amplitude of the basis state 
corresponding to that cell. In this regard, Meyer's QCA mod- 
elling of lattice gases greatly differs from the one presented 
here, and is not suitable as the basis for a more general model 
of QCA. 

Lloyd d introduced a model of physical computation 
based on a chain consisting of a repeating sequence of a fixed 
number of distinguishable nuclear species. In this model, 
pulses are programmed which are capable of distinguish- 
ing the species and performing nearest-neighbor unitary op- 
erations. This model has been further developed by others 
Ill4l4lll42ll . It has been shown that this model is sufficient for 
implementing universal quantum computation. 

The model, sometimes referred to as 'pulse-driven quan- 
tum computers', or Globally Controlled Quantum Arrays 
(GCQA), is different from QCA in that it allows for time- 
dependent evolution. Still, they are closely related in their 
use of only space-homogeneous update rules. For the sake of 
applying results pertaining to one model to the other, it is also 
possible to argue that a pulse-driven quantum computer is a 
degenerate case of a QCA where the update rule is applied 
once. Also, this physical scheme provides a natural platform 
for implementing QCA. 
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D. Comparison and Discussion 

In this section we have gone through a brief discussion of 
the major QCA models already in the literature today. While 
each has its own strengths, it is also true that each has weak- 
nesses that are addressed by the LUQCA model. 

We have seen how general Watrous-van Dam QCA have 
the problem of permitting ill-defined QCA. It is tempting to 
simply restrict attention to well-defined QCA in these models. 
However, deciding whether a given QCA is well-defined or 
not is a hard problem. 

Even when a QCA is guaranteed to be well defined, as is 
the case for the partitioned Watrous-Van dam QCA, these al- 
low for evolution that is not unitary and local, e.g. shift-right 
automata as described in Section IIIIBI This same criticism 
applies to the Schumacher- Werner model of QCA. 

Fortunately, it is possible to simulate any valid QCA in 
these models with local unitary QCA by adding ancillary 
space to each cell, in order to perform the necessary evolu- 
tion in a unitary fashion. 

Meyer's definition of QCA, while being suitable for his pur- 
poses, is not general enough to allow for all the behavior that 
is possible with local unitary QCA, i.e. universal computa- 
tion. Again, QCA in this model can easily be simulated by 
LUQCA. 

Finally, we address globally-controlled quantum arrays. 
There are many similarities and connections between QCA 
and this model of computation. One important connection 
is how globally-controlled arrays can be used to implement 
QCA. However, these two models are quite distinct. A GCQA 
is centered around the idea of doing computation on large ar- 
rays of simple quantum systems, without locally addressing 
them. GCQA divide their lattice of cells, or qudits, into sub- 
sets each of which can be addressed collectively. 

The first major distinction with QCA comes from the fact 
that sequences of pulses applied to these subsets of qudits are 
arbitrary, and do not necessarily follow a time-homogenous 
pattern. The second, is that although Lloyd's construction is 
space homogenous, GCQA are not constrained in such a fash- 
ion. More recently, GCQA have been proposed that have less 
spatially homogenous structures l43ll . 

As a model of computation one can say that QCA are more 
restricted than GCQA. At the same time, QCA are more than 
just a model of computation; they serve also as models of 
physical phenomena. It can be argued that QCA are, in a 
sense, a mart fundamental construct. 



IX. CONCLUSIONS 

In this paper, we have presented a model of quantum cellu- 
lar automata based on local unitary operators. We have shown 
that it has distinct advantages over previous cellular automata 
quantizations. In particular, we have shown that given any 
LUQCA it is always possible to give an efficient, low-depth 
quantum circuit that faithfully represents it. This leads to the 
conclusion that any universal quantum computer could imple- 
ment an LUQCA efficiently. 

More importantly, however, we have also shown that it is 
possible to implement LUQCA in experimental setups that 
are arguably simpler than traditional quantum circuit based 
algorithms: for instance, globally addressing spins in NMR 
or ESR. At the same time, we have shown that our model 
is universal for quantum computation. We gave an explicit 
proof of an efficient simulation of quantum circuits using a 
two-dimensional QCA and mentioned that a proof of univer- 
sality for one-dimensional LUQCA exists as well M26I1 . We 
also showcased the LUQCA as a modelling and simulation 
tool. 

Finally, we have shown that QCA in previous models can 
be efficiently translated into QCA within the model presented 
here. For example, a universal QCA in previous models 
[6, 18] can easily become a universal QCA within the local 
unitary model (see l26ll for an explicit construction). All of 
these facts suggest that the LUQCA is a very strong model. 

The purpose of this paper has been to motivate, develop 
and showcase a model of quantum cellular automata based 
on strictly local, translation-commuting, unitary operators. It 
is our conjecture that the construction given here is the most 
general of this form. 

Ultimately, it is our hope that this paper serves to help unify 
the several methods, results, and views surrounding QCA into 
one single, cohesive paradigm. 
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